Processing apparatus, image pickup apparatus, image pickup system, and processing method

ABSTRACT

A processing apparatus combines a plurality of images based on a plurality of object images formed on an imaging plane of an image sensor by a plurality of lens units and to generate a combined image, and includes at least one processor or circuit that serves as an acquisition task configured to acquire information on a center position of each of the plurality of object images on the imaging plane, information on a correspondence relationship between the center position and positions of the plurality of images in the combined image, and conversion information for converting a first coordinate system in the imaging plane into a second coordinate system in the combined image, the conversion information being generated based on a correction function for correcting the plurality of object images, and a processing task configured to generate the combined image using the conversion information.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 17/320,407, filed on May 14, 2021, which claims the benefit of and priority to Japanese Patent Application No. 2020-097907, filed Jun. 4, 2020, each of which is hereby incorporated by reference herein in their entirety.

BACKGROUND OF THE INVENTION Field of the Invention

The present invention relates to a processing apparatus, an image pickup apparatus, an image pickup system, and a processing method.

Description of the Related Art

A method of generating a combined image has conventionally been known by acquiring a plurality of images different from each other in single imaging using an optical system that forms a plurality of images of the same object with a plurality of lenses, and by staking (superimposing) the acquired plurality of images. Japanese Patent No. 5910739 discloses an image pickup apparatus that includes a filter array and a lens array after the objective lens, and can simultaneously acquire a plurality of images for generating a multiband image with a common image sensor.

In the image pickup apparatus disclosed in JP 5910739, a plurality of lens units in the lens array image light having different wavelengths from each other, and cause the distortion in a plurality of spectroscopic images and consequently a shift in positional information among the plurality of spectroscopic images. A correct result cannot be obtained by a spectroscopic analysis that uses the multiband image generated by stacking the plurality of spectroscopic images in which the positional information shifts.

SUMMARY OF THE INVENTION

The present invention provides a processing apparatus, an image pickup apparatus, an image pickup system, and a processing method, each of which can generate a good combined image.

A processing apparatus according to one aspect of the present invention is configured to combine a plurality of images based on a plurality of object images formed on an imaging plane of an image sensor by a plurality of lens units and to generate a combined image. The processing apparatus includes at least one processor or circuit configured to execute a plurality of tasks including an acquisition task configured to acquire information on a center position of each of the plurality of object images on the imaging plane, information on a correspondence relationship between the center position and positions of the plurality of images in the combined image, and conversion information for converting a first coordinate system in the imaging plane into a second coordinate system in the combined image, the conversion information being generated based on a correction function for correcting the plurality of object images, and a processing task configured to generate the combined image using the conversion information. At least one processor or circuit is configured to perform a function of at least one of the units. An image pickup apparatus and an image pickup system including the above processing apparatus and a processing method for a processing apparatus corresponding to the above processing apparatus also constitute another aspect of the present invention.

Further features of the present invention will become apparent from the following description of exemplary embodiments with reference to the attached drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A to 1D schematically illustrate an image pickup system according to a first embodiment.

FIGS. 2A and 2B explain a distortion correcting method according to the first embodiment.

FIGS. 3A to 3C schematically illustrate a data structure of a multiband image.

FIG. 4 is a flowchart showing a method of converting an image acquired in the first embodiment into a multiband image.

FIG. 5 schematically illustrates an image pickup system according to a third embodiment.

FIG. 6 explains an eccentric distortion.

DESCRIPTION OF THE EMBODIMENTS

Referring now to the accompanying drawings, a detailed description will be given of embodiments according to the present invention. Corresponding elements in respective figures will be designated by the same reference numerals, and a duplicate description thereof will be omitted.

First Embodiment

FIGS. 1A to 1D schematically illustrate an image pickup system according to this embodiment. FIG. 1A is a side view of the image pickup system, and FIG. 1B is a front view of the image pickup system viewed from the object side. The image pickup system includes an optical system, an image sensor SS, and a processor 100. The optical system includes a filter array FA (omitted in FIG. 1B) and a lens array LA. The image pickup system may include an optical apparatus including the optical system, and an image pickup apparatus that includes the image sensor SS and the processor 100 and is mountable with the optical apparatus.

As illustrated in FIG. 1C, the processor 100 includes an acquisition unit (acquisition task) 100 a and a processing unit (processing task) 100 b. The processor 100 may further include a memory (not shown) for storing information acquired by the acquisition unit 100 a and information generated (processed) by the processing unit 100 b. Although the processor 100 is installed in the image pickup system in this embodiment, it may be configured as a processing apparatus separate from the image pickup system. The processor 100 may be installed in the image pickup apparatus or optical apparatus.

The lens array LA includes a 2×2 array lens unit (imaging unit) ML11, ML12, ML21, and ML22. This embodiment sets 2×2 lens units to the lens array LA for simple explanation, but the present invention is not limited to this example. Even when a plurality of lenses are disposed in an array, it serves as the lens array LA.

Since each lens unit is disposed and focused on the image sensor SS, the same object image is imaged (formed) on the imaging plane of the image sensor SS by the number of lens units. Since a plurality of object images are formed in a tile shape according to the arrangement of the lens units, the images formed by the lens units will be referred to as “tile images” in the following description. The tile image is developed around an intersection as a center position between the optical axis of the corresponding lens unit and the image sensor SS.

The filter array FA is disposed on the optical axis of the corresponding lens unit, and includes a plurality of bandpass filters having transmission characteristics different from each other. Making the transmission characteristics of the plurality of bandpass filters different from each other can develop a plurality of tile images of the same object and light having different wavelengths on the imaging plane. In other words, the image pickup system according to this embodiment can acquire a plurality of spectral images at once (simultaneously). Since the number of tile images (number of bands) is proportional to the number of bandpass filters and lens units, the number of bandpass filters and lens units may be increased or decreased according to the required number of bands.

The processor 100 generates a multiband image (combined image) by virtually stacking (superimposing or combining) a plurality of (spectroscopic) images based on a plurality of tile images acquired from the image sensor SS. A colored image acquired by a normal camera is generated by superimposing three-layer (R, G, B) spectroscopic images (two-dimensional luminance map) containing different color (spectral wavelength) information. On the other hand, the multiband image is generated by stacking spectroscopic images of more than three layers. An interval in the layer direction (wavelength direction) (increment of the spectral wavelength) corresponds to a wavelength resolution, and the number of layers (overall width of the wavelength) corresponds to a wavelength range. Although it depends on the data structure, a regular interval of the spectral wavelength and the spectral images arranged in descending or ascending order of the spectral wavelengths can provide a result easier to use for the analysis. The spectroscopic images of important spectral wavelengths may be arranged in front of the data to accelerate the analysis, or an irregular interval of the spectral wavelength may be used.

Given information on a correspondence between information on the center position of the tile image (center position information) and spectral wavelength information (layer position of the spectroscopic image in the multiband image), the multiband image stacked in the wavelength order follows. This embodiment stores, as stack information, the information on the correspondence between the center position information and the spectral wavelength information.

The configuration illustrated in FIG. 1A may need focusing of the lens array LA depending on the object distance. FIG. 1D illustrates a modified example of the image pickup system. In FIG. 1C, a focusing lens L1 having a larger aperture is disposed on the object side of the lens array LA. When an exit pupil of the lens L1 includes entrance pupils of all lens units in the lens array LA, an imaging relationship of the optical system is maintained. If this condition is satisfied, focusing can be achieved by simply disposing a lens having a short focal length while the lens array LA is fixed relative to the image sensor SS, particularly in imaging a close object.

The distortion of the lens unit will now be described. When the plurality of lens units have the same design, each image has a chromatic aberration when the plurality of lens units image light that has transmitted through different bandpass filters. In the chromatic aberrations, the lateral chromatic aberration is particularly observed as if the image height is expanded or contracted in comparison with each tile image. In other words, the lateral chromatic aberration viewed in the tile image is observed like the distortion, and image processing (image correction) cannot distinguish between the lateral chromatic aberration and the distortion. Therefore, the lateral chromatic aberration and the distortion in the (two-dimensional) spectroscopic imaging are collectively treated as the distortion. The multiband image can be generated by simply stacking a plurality of spectroscopic images acquired by the imaging sensor SS, but the distortion for each tile image causes a shift in positional information among the spectroscopic images. This shift appears as a color shift on the multiband image and may deteriorate the image quality of the multiband image. Therefore, even when the multiband image is used for the spectroscopic analysis, a correct result cannot be obtained.

A description will now be given of a method for correcting the distortion. FIGS. 2A and 2B explain the distortion correcting method according to this embodiment. FIG. 2A illustrates a distortion correcting chart (referred to as a grid chart hereinafter) in which grid points (dots) are arranged in a grid pattern. FIG. 2B schematically illustrates tile images formed on the imaging plane when the grid chart is imaged by the image pickup system according to this embodiment.

As illustrated in FIG. 2B, the tile images corresponding to the lens units ML11, ML12, ML21, and ML22 are formed in sections T11, T12, T21, and T22 shown by the broken lines on the imaging plane, respectively. For better understanding, a distortion amount is exaggerated, and the grid points in the grid chart image are connected by lines. The center position of the tile image is expressed by a symbol X. The distortion correction using a grid chart is realized by searching for a set of a correction function and a coefficient in which the positions of the grid points in the grid chart image are orthogonally returned. In order to correct the distortion of the tile image, this embodiment divides the tile image from the entire image acquired by the image sensor SS, and generates a set of the correction function and the coefficient in the coordinate system based on the center position of the tile image. The number of sets of the correction function and the coefficient is as many as the number of tile images. The information (array) obtained by the correction function and the coefficient indicates to which coordinate the original coordinate in the tile image including the distortion should be corrected. Information Y_(t) on the position of the corrected tile image in the tile coordinate system in the tile image is expressed by the following equation (1):

Y _(t) =F _(t)(X _(t) −X _(ct))  (1)

where t is the tile image number (tile number), X_(t) is information on the position of the tile image in the sensor coordinate system (first coordinate system) in the imaging plane, X_(ct) is information on the center position of the tile image in the sensor coordinate system, and F_(t) is the correction function (including the coefficient) for correcting the tile image.

Given the information X_(ct) on the center position of the tile image and the correction function F_(t) as a result of the preliminary calibration, the information Y_(t) on the position of the corrected tile image can be obtained. In the subsequent imaging, the distortion is corrected for each tile image based on the information Y_(t) on the position of the corrected tile image, and the multiband image is generated by superimposing images based on the corrected tile images. Since the number of divisions of the tile image and the number of distortion correcting calculations increase in proportion to the number of tile images, the multiband image generating speed decreases. When a motion image is generated using the multiband image and the generating speed of a single multiband image (1 frame) decreases, the frame rate decreases.

A description will now be given of the distortion correcting method that restrains the multiband image generating speed from lowering even if the number of tile images increases. FIGS. 3A to 3C schematically illustrate a data structure of the multiband image. When images based on two-dimensional tile images are stacked in order of wavelength, the multiband image can be expressed by a three-dimensional data structure using an x-axis and a y-axis for representing the spectroscopic image and a λ-axis for representing the wavelength, as illustrated in FIG. 3A. This three-dimensional data structure will be referred to as a “data cube.” FIG. 3A illustrates an ideal data cube. In the data cube illustrated in FIG. 3A, the distortion and the like are sufficiently corrected, and the positional information is correct (or is not shifted). Each pixel value (luminance) in the multiband image discretely exists on the three-dimensional coordinate grid point. This is similarly applied to any of the x-axis, y-axis, and λ-axis, and it can be interpreted that only the number of divisions (resolution) is different.

The expression (1) is an operation formula that converts the information X_(t) on the position of the tile image in the sensor coordinate system into the information Y_(t) on the position of the corrected tile image in the tile coordinate system. In the expression (1), the correction function F_(t) corresponds to a correction coefficient, and the information X_(ct) on the position of the center position of the tile image in the sensor coordinate system corresponds to an offset. FIGS. 3B and 3C illustrate pre-correction and post-correction tile images, respectively. A multiband image is generated by stacking images based on the corrected tile image in the wavelength order. Internal processing corresponds to mapping all pixels on the image based on the corrected tile image onto the three-dimensional image. This mapping can be realized given the correspondence information between the tile number t in the expression (1) and the coordinate (layer position) on the λ-axis. In summary, the destination (address on the data cube) of each pixel value on the imaging plane is determined once the correction function is determined. Information Z on the position in the data cube coordinate system is expressed by the following expression (2):

Z=GF(X−X _(c))  (2)

where X is information on the position of the tile image in the sensor coordinate system, X_(c) is information on the center position of the tile image in the sensor coordinate system, F is a matrix representing the distortion correction (coordinate transformation) of the tile image, and G is a matrix for converting the tile coordinate system into the data cube coordinate system (second coordinate system) in the multiband image.

The expression (2) can be used to convert the information X on the position of the tile image in the sensor coordinate system into the information Z on the position in the data cube coordinate system without using the tile number t. Since the matrices F and G are linear transformation matrices, the expression (2) can be turned into the following expression (3):

Z=H(X−X _(c))  (3)

where H (conversion information) is a matrix that collectively expresses the matrices F and G for converting one sensor coordinate system into the data cube coordinate system.

By using the expression (3), the image acquired by a single calculation can be divided into tile images and expanded on the data cube. Since only the calculation for the sensor coordinate system is required, a calculation amount is constant without being affected by the increase or decrease of the number of tile images. By acquiring the matrix H and the information X_(c) generated during the calibration or the information X and Z, the acquired image can be converted into a multiband image at a high speed. Since the above calculation is a linear transformation, it can be easily incorporated into a parallel computer (GPU or FPGA).

FIG. 4 is a flowchart showing a method of converting an image acquired in this embodiment into a multiband image.

In the step S101, the processor 100 (acquisition unit 100 a) acquires an image made by capturing a grid chart image from the image sensor SS. In this embodiment, the grid chart illustrated in FIG. 2A is used.

In the step S102, the processor 100 (acquisition unit 100 a) acquires information on the positions of the grid points of the grid chart in the sensor coordinate system. The information on the positions of the grid points is distortion information. In this embodiment, a dot is disposed at the grid point, but its type is not limited as long as distortion information can be obtained. A dot coordinate can be obtained at a relatively high speed using a Hough transform or the like.

In the step S103, the processor 100 (acquisition unit 100 a) acquires information on the center position of the tile image in the sensor coordinate system. The processor 100 may calculate the information on the center position of the tile image using the information on the positions of the grid points for each division, or may acquire the information from another means. The grid chart may be captured so as to fill the angle of view of each tile image, and the center coordinate of the grid chart may be substituted for information on the center position of the tile image. This embodiment uses a grid chart consisting of 9×9 grid points, and thus the center coordinate of the grid point.

In the step S104, the processor 100 (acquisition unit 100 a) acquires stack information which is information on the correspondence between the information on the center position of the tile image in the sensor coordinate system and the spectral wavelength information (layer position in the spectroscopic image in the multiband image).

In the step S105, the processor 100 (acquisition unit 100 a) acquires a correction function (including a coefficient) for correcting the tile image.

In the step S106, the processor 100 (acquisition unit 100 a) acquires the matrix H as the conversion information used to convert into the data cube coordinate system the sensor coordinate system generated based on the information on the center position of the tile image in the sensor coordinate system, the stack information, and the correction function. The processing unit 100 b may generate the matrix H.

In the step S107, the processor 100 (processing unit 100 b) generates a multiband image using the object image acquired from the image sensor SS, and the matrix H.

In this embodiment, the distortion information of the optical system is acquired by capturing the grid chart with an actual machine, but may be acquired by using a simulation or a design value. If the manufacturing error of the optical system can be sufficiently reduced, the distortion map calculated for each tile image may be used as information on the positions of the grid points for each tile image. As long as the information on the center position of the tile image in the sensor coordinate system can be accurately acquired, the information generated from the optical design value of the lens unit may be used as the information on the positions of the grid points in the tile coordinate system.

As described above, the configuration of this embodiment can acquire a good combined image (multiband image) at a high speed.

Second Embodiment

A basic configuration of the image pickup system according to this embodiment is the same as that of the first embodiment. This embodiment will discuss only the differences from the first embodiment.

The configuration according to this embodiment simultaneously corrects a luminance decrease around the object image or so-called shading. This embodiment corrects shading based on not only vignetting of the optical system and the sensitivity characteristic of the image sensor SS relative to the light incident angle, but also the transmission characteristic of the bandpass filter relative to the incident angle.

The simplest shading correction is a method for capturing a white object on the entire surface and for generating a luminance correcting map (correction information) backwardly calculated so that the in-plane luminance distribution of each tile image becomes constant. This embodiment uses the filter array FA including different types of bandpass filters, and thus the white color here may be white (reflectance is constant within the use wavelength) over the entire use wavelength. The luminance correcting map may not be divided for each tile image and may be used in the form of the sensor coordinate system. When the pixel value is coordinate-converted using the expression (3), the luminance correcting map is simultaneously coordinate-converted and thereby shading is simultaneously corrected. Before the distortion correction (coordinate conversion) is performed, the shading may be corrected on the entire image previously acquired from the image sensor SS by using the luminance correcting map. This method only performs a simple multiplication, has a small calculation load, and is not affected by an increase or decrease of the number of tile images.

As described above, the configuration according to this embodiment can correct shading at a high speed, in addition to the effect of the first embodiment.

Third Embodiment

The first embodiment has discussed a method of correcting the chromatic aberration generated in the lens array, but this embodiment will discuss a method of correcting different aberrations.

FIG. 5 schematically illustrates an image pickup system according to this embodiment. The image pickup system includes an optical system, an image sensor SS, and a processor 100. The optical system includes a lens L1, a filter array FA (omitted in FIG. 5 for simple explanation), and a lens array LA.

When a common optical system such as the lens L1 (referred to as a “main optical system” hereinafter) is disposed on the object side of the lens array LA, optical axes (labelled as AX1 and AX2 in FIG. 5) of the lens units in the lens array LA shift from an optical axis (main optical axis) AX0 of the main optical system. Thus, many lens units in the lens array LA have different eccentric distortions depending on the distance and direction from the main optical axis AX0. In an attempt to correct an eccentric distortion that differs for each tile image, a calculation load will increase because it is necessary to calculate a displacement in the information on all the positions in the tile image, and a large memory capacity is necessary to store the information on the distorted position.

This embodiment reduces a data amount from the viewpoint of geometrical optics. A shift amount (eccentric component) of the lens unit in the vertical direction from the main optical axis AX0 is considered as a vector and is defined as an “eccentric amount ε” When the eccentricity c is 0, that is, when the optical axis of the lens unit is located on the main optical axis AX0, the distortion of the tile image captured by the lens unit becomes symmetrical with respect to the optical axis, so the correction function can be described with a single variable as an angle of view ω. The distortion of the tile image formed by the lens unit when the eccentricity is 0 will be referred to as a “reference distortion.”

It is known that the eccentric distortion is proportional to the product of the eccentric amount ε and the square of the angle of view ω. FIG. 6 explains the eccentric distortion. In FIG. 6, the displacement amount is exaggerated. The image is deformed (eccentrically distorted) by the amount and direction determined by the eccentric amount ε. The magnitude of the deformation amount can be expressed by a coefficient. In summary, the total distortion of the tile image formed by the lens unit disposed at the eccentric position can be approximated by the product of the reference distortion component and the eccentric distortion component. A tile image correction function f_(di) is expressed by the following expression (4):

F _(di)(x _(i) ,y _(i))=f ₁(ω_(i))×f ₂(ε_(i))  (4)

where i is a tile number, x_(i) and y_(i) represent information on the corrected position in the tile image in the sensor coordinate system, ω_(i) is an angle of view in the tile image, and ε_(i) is an eccentric amount of the lens unit from the main optical axis corresponding to the tile image, f₁ is a function including a reference distortion component (angle of view component), and f₂ is a function including an eccentric distortion component (shift amount component).

Backwardly calculating and assigning the coefficients suitable for the functions f1 and f2 can generate a correction function for the eccentric distortion of the tile image. Thus, the correction amount of the reference distortion is previously calculated and stored as one-dimensional data of the angle of view ω, and thereby the distortion correction amount of the tile image can be generated by a simple calculation.

As described above, the configuration according to this embodiment can correct an eccentric distortion with a small data amount.

Fourth Embodiment

As described in the first embodiment, the lateral chromatic aberration generated in the lens unit in the lens array LA can be collectively treated as the distortion in the image processing, and thus can be corrected by the same function as that of the distortion correction. The lateral chromatic aberration is generally expressed by the difference from the image height of the reference wavelength. The correction function of the tile image is expressed with a function f_(ci) representing a distortion component of the tile image at a reference wavelength where an arbitrary tile image is imaged at the reference wavelength, and a function f₃ representing a distortion component (deformed component) caused by the lateral chromatic aberration of the spectral wavelength. The distortion correction function f_(di) of the tile image is expressed by the following expression (5):

f _(di)(x _(i) ,y _(i))=f _(ci)(x _(0i) ,y _(0i),λ₀)×f ₃(λ_(i))  (5)

where i is a tile number, x_(i) and y_(i) include information on post-correction position in the tile image in the sensor coordinate system, x_(0i) and y_(0i) include information on pre-correction position in the tile image in the sensor coordinate system, λ₀ is the reference wavelength, and λ_(i) is an imaging wavelength of the tile image.

Since the distortion component caused by the lateral chromatic aberration can be treated independently of the eccentric distortion in the third embodiment, the distortion correction function f_(i)(x_(i), y_(i)) is expressed by the following expression (6) in combination with the third embodiment.

f _(di)(x _(i) ,y _(i))=f ₁(ω_(i))×f ₂(ε_(i))×f ₃(λ_(i))  (6)

where f_(di) is a distortion correction function of the tile image (i), x_(i) and y_(i) include information on post-correction position in the tile image (i) in the sensor coordinate system, f₁ is a function representing a reference distortion component, ω_(i) is an angle of view in the tile image, f₂ is a function representing an eccentric distortion component caused by an eccentric amount ε_(i), and ε_(i) is an eccentric amount (shift amount) [mm] from the main optical axis in the vertical direction.

As described above, the configuration according to this embodiment can correct the lateral chromatic aberration with a small data amount.

Each embodiment corrects the distortion generated by the filter array FA, but the present invention is applicable to a correction of the distortion generated by the lens array LA. For example, when the lens units in the lens array LA are manufactured with different designs, the distortions generated in the plurality of tile images are different from each other.

The above embodiment can provide a processing apparatus, an image pickup apparatus, an image pickup system, and a processing method, each of which can generate a good combined image.

Other Embodiments

Embodiment(s) of the present invention can also be realized by a computer of a system or apparatus that reads out and executes computer executable instructions (e.g., one or more programs) recorded on a storage medium (which may also be referred to more fully as a ‘non-transitory computer-readable storage medium’) to perform the functions of one or more of the above-described embodiment(s) and/or that includes one or more circuits (e.g., application specific integrated circuit (ASIC)) for performing the functions of one or more of the above-described embodiment(s), and by a method performed by the computer of the system or apparatus by, for example, reading out and executing the computer executable instructions from the storage medium to perform the functions of one or more of the above-described embodiment(s) and/or controlling the one or more circuits to perform the functions of one or more of the above-described embodiment(s). The computer may comprise one or more processors (e.g., central processing unit (CPU), micro processing unit (MPU)) and may include a network of separate computers or separate processors to read out and execute the computer executable instructions. The computer executable instructions may be provided to the computer, for example, from a network or the storage medium. The storage medium may include, for example, one or more of a hard disk, a random-access memory (RAM), a read only memory (ROM), a storage of distributed computing systems, an optical disk (such as a compact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™), a flash memory device, a memory card, and the like.

While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.

This application claims the benefit of Japanese Patent Application No. 2020-097907, filed on Jun. 4, 2020 which is hereby incorporated by reference herein in its entirety. 

1-10. (canceled)
 11. A processing apparatus configured to generate a combined image by stacking a plurality of images based on a plurality of object images formed on an imaging plane of an image sensor by a plurality of lens units, the processing apparatus comprising: at least one processor or circuit configured to execute a plurality of tasks including: an acquisition task configured to acquire conversion information for converting a first coordinate system in the imaging plane into a second coordinate system in the combined image, the conversion information being generated based on information on a center position of each of the plurality of object images on the imaging plane, information on a correspondence relationship between the center positions and layer positions of the plurality of images in the combined image, and a correction function for correcting the plurality of object images; and a processing task configured to generate the combined image using the conversion information.
 12. The processing apparatus according to claim 11, wherein the acquisition task acquires correction information for correcting an in-plane luminance distribution of an image corresponding to each of the plurality of object images, and wherein the processing task corrects an image corresponding to each of the plurality of object images using the correction information.
 13. The processing apparatus according to claim 11, further comprising a memory configured to store the conversion information.
 14. An image pickup apparatus comprising: the processing apparatus according to claim 11; and an image sensor configured to generate the plurality of images.
 15. The image pickup system comprising: the image pickup apparatus according to claim 14; and an optical system that includes a plurality of lens units.
 16. The image pickup system according to claim 15, wherein the optical system includes a plurality of filters disposed on optical axes of the plurality of lens units and having transmission characteristics different from each other.
 17. The image pickup system according to claim 16, wherein the correction function includes a function including a deformation component caused by a lateral chromatic aberration of each of the plurality of object images, and a function including a distortion component of each of the plurality of object images at a reference wavelength.
 18. The image pickup system according to claim 15, wherein the optical system includes a main optical system on an object side of the plurality of lens units.
 19. The image pickup system according to claim 18, wherein the correction function includes a function including an angle of view component of each of the plurality of object images, and a function including a component of a shift amount between an optical axis of the main optical system and an optical axis of each of the plurality of lens units.
 20. A processing method for a processing apparatus configured to generate a combined image by stacking a plurality of images based on a plurality of object images formed on an imaging plane of an image sensor by a plurality of lens units, the processing method comprising the steps of: acquiring conversion information for converting a first coordinate system in the imaging plane into a second coordinate system in the combined image, the conversion information being generated based on information on a center position of each of the plurality of object images on the imaging plane, information on a correspondence relationship between the center positions and layer positions of the plurality of images in the combined image, and a correction function for correcting the plurality of object images; and generating the combined image using the conversion information. 